Multipass algorithm for acquisition of salient acoustic morphemes

نویسندگان

  • Michael Levit
  • Allen L. Gorin
  • Jeremy H. Wright
چکیده

We are interested in spoken language understanding within the domain of automated telecommunication services. Our current methodology involves training statistical language models from large annotated corpora for recognition and understanding. Since the transcribing of large speech corpora is a resource consuming task, we are motivated to exploit speech without transcriptions. In particular, we learn the semantic associations for a task exploiting only phone-based sequences from the output of a task-independent ASR-system. In this paper we present a new multipass algorithm for acquiring salient phone sequences from untranscribed speech corpora and evaluate their utility for the HMIHY task. Compared to our previous strategy, this algorithm is shown to produce improved call-classification results while reducing up to 7-fold the number of salient phone-sequences selected for training.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Accuracy Order of Grammatical Morphemes in Persian EFL Learners: Evidence for and against UG

This study addresses the acquisition of the morphological markers in Persian learners of English as a foreign language. To this end, the accuracy order of nine morphemes including plural –s, progressive –ing, copula be, auxiliary be, irregular past tense, regular past tense –ed, third person –s, possessive -ʼs and indefinite articles was studied in 6...

متن کامل

Errors of Omission and Commission in Verbal and Nominal Inflectional Morphemes by Children with SLI: Phonological Effects and Acoustic Analysis

It has previously been shown that inconsistency in the early morpheme productions of typically developing (TD) children and those with Specific Language Impairment (SLI) can be partly explained by the phonological complexity of the coda. However, it is not yet known whether TD and SLI children have similar underlying processes of morpheme acquisition. Of particular interest is the reported late...

متن کامل

Running head: L1 INFLUENCE ON MORPHEME ACQUISITION ORDER 1 L1 Influence on the Acquisition Order of English Grammatical Morphemes: A Learner Corpus Study

We revisit morpheme studies to evaluate the long-standing claim for a universal order of acquisition. We investigate the L2 acquisition order of six English grammatical morphemes by learners from seven L1 groups across five proficiency levels. Data are drawn from approximately 10,000 written exam scripts from the Cambridge Learner Corpus. The study establishes clear L1 influence on the absolute...

متن کامل

An empirical study of multipass decoding for vietnamese LVCSR

In this paper, we represent an empirical study of multipass decoding for Vietnamese LVCSR. We report our experiments with N-best, lattice and consensus decoding on the VNBN data. Results from this study indicate that our acoustic model for Vietnamese was precise. The results could be investigated in further steps to improve the performance of our system. Index Terms Vietnamese, Acoustic Model, ...

متن کامل

مقایسه روش‌های مختلف یادگیری ماشین در خلاصه‌سازی استخراجی گفتار به گفتار فارسی بدون استفاده از رونوشت

In this paper, extractive speech summarization using different machine learning algorithms was investigated. The task of Speech summarization deals with extracting important and salient segments from speech in order to access, search, extract and browse speech files easier and in a less costly manner. In this paper, a new method for speech summarization without using automatic speech recognitio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001